Skip to content

[SYSTEMDS-3367] Integrate UDF encoders in task graph#1623

Closed
phaniarnab wants to merge 1 commit into
apache:mainfrom
phaniarnab:tfnorm1
Closed

[SYSTEMDS-3367] Integrate UDF encoders in task graph#1623
phaniarnab wants to merge 1 commit into
apache:mainfrom
phaniarnab:tfnorm1

Conversation

@phaniarnab
Copy link
Copy Markdown
Contributor

This patch integrates UDF-based transformencoders into the
task graph to allow concurrent execution. As we cannot estimate
the sparsity of an arbitrary UDF output, we always allocate
dense output matrix if at least one UDF is present. If an UDF
comes after a dummycode, we slice the expanded columns and
apply the UDF. Moreover, we disable row partitioning for UDFs.

This patch integrates UDF-based transformencoders into the
task graph to allow concurrent execution. As we cannot estimate
the sparsity of an arbitrary UDF output, we always allocate
dense output matrix if at least one UDF is present. If an UDF
comes after a dummycode, we slice the expanded columns and
apply the UDF. Moreover, we disable row partitioning for UDFs.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant